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Abstract 



We present a new formalization of the spine change of measure approach for branching 
diffusions that improves on the scheme laid out for branching Brownian motion in Kyprianou 
[28], which itself made use of earlier works of Lyons et al [31, 30, 27]. We use our new 
formulation to interpret certain 'Gibbs-Boltzmann' weightings of particles and use this 
to give a new, intuitive and proof of a more general 'Many-to-One' result which enables 
expectations of sums over particles in the branching diffusion to be calculated purely in 
terms of an expectation of one particle. Significantly, our formalization has provided the 
foundations that facilitate a variety of new, greatly simplified and more intuitive proofs in 
branching diffusions: see, for example, the C convergence of additive martingales in Hardy 
and Harris [19], the path large deviation results for branching Brownian motion in Hardy 
and Harris [18] and the large deviations for a continuous-typed branching diff'usion in Git 
et al [15] and Hardy and Harris [17]. 

1 Introduction 

One of the central elements of the spine approach is to interpret the behaviour of a branching 
process under a certain change of measure. Such an interpretation was first laid out by Chauvin 
and Rouault [7] in the case of branching Brownian motion, and we first briefly review the main 
ideas on a heuristic level. 

Consider a branching Brownian motion (BBM) with constant breeding rate r, that is, a 
branching process whereby particles diffuse independently according to a Brownian motion and 
at any moment undergo fission at a rate r to produce two particles that each evolve independently 
from their birth position, and so on. We suppose that the probabilities of this process are given 
by {P=^ : X e M}, where is a measure defined on the natural filtration {!Ft)t>o such that 
it is the law of the process initiated from a single particle positioned at x. Suppose that the 
configuration of this branching Brownian motion at time t is given by the M- valued point process 
Xt := {^Xu{t) : u £ Nt] where Nt is the set of individuals alive at time t. It is well known that 
for any A G M, 



defines a strictly-positive P-martingale, so Zx{oo) := Hmt^ao is almost surely finite under 

P^. The important contribution of Chauvin and Rouault [7] was to determine a pathwise 
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construction of the measure where 
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with the term ^a(O) a-cts as a normahzing factor. 

The measure defined at (2) is equivalent to the following pathwise construction: 

• starting from position a;, the original ancestor diffuses according to a Brownian motion on 
R with drift A; 

• at an accelerated rate 2r the particle undergoes fission producing two particles; 

• with equal probability, one of these two particles is selected; 

• this chosen particle repeats stochastically the behaviour of the parent; 

• the other particle initiates, from its birth position, an independent copy of a P branching 
Brownian motion with branching rate r. 

The chosen line of descent in such pathwise constructions of the measure, here Qa, has come 
to be known as the spine as it can be thought of as the backbone of the branching process Xt 
from which all particles are born. Although Chauvin and Rouault's work on the measure change 
continued in a paper co-authored with Wakolbinger [8], where the new measure is interpreted 
as the result of building a conditioned tree using the concepts of Palm measures, it wasn't until 
the so-called 'conceptual proofs' of Lyons, Kurtz, Peres and Pemantle published around 1995 
([31, 30, 27]) that the spine approach really began to crystalize. These papers laid out a formal 
basis for spines using a series of new measures on two underlying spaces of sample trees with 
and without distinguished lines of descent (the spine). Of particular interest is the paper by 
Lyons [30] which gave a spine-based proof of the /^^-convergence of the well-known martingale 
for the Galton- Watson process. Here we first saw the spine decomposition of the martingale as 
the key to using the intuition provided by Chauvin and Rouault's pathwise construction of the 
new measure - Lyons used this together with a previously known measure-theoretic result on 
Radon-Nikodym derivatives that allows us to deduce the behaviour of the change-of-measure 
martingale under the original measure by investigating its behaviour under the second measure. 
Similar ideas have recently been used by Kyprianou [2S] to investigate the £^ -convergence of 
the BBM martingale (1), by Biggins and Kyprianou [2] for multi-type branching processes in 
discrete time, by Geiger [11, 12] for Galton- Watson processes, by Georgii and Baake [14] to study 
ancestral type behaviour in a continuous time branching Markov chain, as well as Olofsson [33]. 
Also see Athreya [1], Geiger [9, 10, 13], Iksanov [23], Rouault and Liu [29] and Waymire and 
Williams [34], to name just a few other papers where spine and size-biasing techniques have 
already proved extremely useful in branching process situations. 

In this article we present a new formalization of the spine approach that improves the schemes 
originally laid out by Lyons et al [31, 30, 27] and later for BBM in Kyprianou [2iS]. Although the 
set-up costs of our spine formalization are quite large, at least in terms of definitions and notation, 
the underlying ideas are all extremely simple and intuitive. The power of this approach should 
not be underestimated. The further techniques developed subsequently in Hardy and Harris 
[19, 18, 17], Git et. al. [15] and J.W.Harris & S.C.Harris [20] manage to completely bypass 
many previous technical problems and difficult non-linear calculations, with spine calculations 
facilitating the reduction to relatively straightforward classical one-particle situations; this paper 
serves as a foundation for these other works. 

In the first instance our improvements correct a perceived weakness in the Lyons et al scheme 
where one of the measures they defined had a time-dependent mass and could therefore not be 
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normalized to be a probability measure and lacked a clear interpretation in terms of any direct 
process construction; an immediate consequence of this improvement is that here all measure 
changes are carried out by martingales and we regain a clear intuitive construction. 

Another difference is in our use of filtrations and sub-filtrations, where Lyons et al instead 
used marginalizing. As we shall show, this brings substantial benefits since it allows us to relate 
the spine and the branching diffusion through the conditional-expectation operation, and in 
this way gives us a proper methodology for building new martingales for the branching diffusion 
based on known martingales for the spine. In other work we have used this powerful construction 
to obtain martingales for large-deviations problems, including a neat proof of a large deviations 
principle for branching Brownian motion in [ ], and to study a BBM with quadratic branching 
rate in [2U]. 

The conditional-expectation approach also leads directly to new and simple proofs of some 
key results for branching diffusions. The first of these concerns the relation that becomes clear 
between the spine and the 'Gibbs-Boltzmann' weightings for the branching particles. Such 
weightings are well known in the theory of branching process, and Harris [22] contains some 
analysis for a model of a typed branching diffusion that is similar in spirit to the models we 
shall be considering here. In our formulation these important weightings can be interpreted as a 
conditional expectation of a spine event, and we can use them to immediately obtain a new and 
very useful interpretation of the additive operations previously seen only within the context of 
the Kestcn-Stigum theorem and related problems. Another application of our approach we give 
a substantially easier proof of a far more general Many-to-One theorem that is so often useful 
in branching processes applications; for example, in Champneys et al [3] or Harris and Williams 
[21] it was a key tool in their more classical approaches to branching diffusions. 

The layout of this paper is as follows. In Section 2, we will introduce the branching models; 
describing a binary branching multi-type BBM that we will frequently use as an example, before 
describing a more general branching Markov process model with random family sizes. In Section 
3, we introduce the spine of the branching process as a distinguished infinite line of descent 
starting at the initial ancestor, we describe the underlying space for the branching Markov 
process with spine and we also introduce various fundamental filtrations. In Section 4, we 
define some fundamental probability spaces, including a probability measure for the branching 
process with a randomly chosen spine. In Section 5, various martingales are introduced and 
discussed. In particular, we see how to use filtrations and conditional expectation to build 
'additive' martingales for the branching process out of the product of three simpler 'one-particle' 
martingales that only depend on the behaviour along the path of the spine; used as changes of 
measure, one martingale will increase the fission rate along the path of the spine, another will 
size-bias the offspring distribution along the spine, whilst the other one will change the motion 
of the spine. Section 6 discusses changes of measure with these martingales and gives very 
important and useful intuitive constructions for the branching process with spine under both 
the original measure P and the changed measure Q. Another extremely useful tool in the spine 
approach is the spine decomposition that we prove in Section 7; this gives an expression for 
the expectation of the 'additive' martingale under the new measure Q conditional on knowing 
the behaviour all along the path of the spine (including the spine's motion, the times of fission 
along the spine and number of offspring at each of the spine's fissions). Finally, in Section 8, 
we use the spine formulation to derive an interpretation for certain Gibbs-Boltzmann weights 
of Q, discussing links with theorems of Kesten-Stigum and Watanabe, in addition to proving 
a very general 'Many-to-One' theorem that enables expectations of sums over particles in the 
branching process to be calculated as expectation only involving the spine. 
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2 Branching Markov models 



Before we present the underlying constructions for spines, it will be useful to give the reader an 
idea of the branching-diffusion models that we have in mind for applications. We first discuss a 
finite-type branching diffusion, and then present a more general model that shall be used as the 
basis of our spine constructions in the following sections. 

2.1 A finite-type branching diff"usion 

Suppose that for a fixed n e N we are given two sets of positive constants a(l), . . . , a{n) and 
r(l), . . . , r(n). Consider a typed branching diffusion in which the type of each particle moves as 
a finite, irreducible and time-reversible Markov chain on the set / := {f, ... ,71} with Q-matrix 
6Q (6 is a strictly positive constant that could be considered as the temperature of the system) 
and invariant measure tt = {7r(f), . . . ,7r(7i)}. The spatial movement of a particle of type y is a 
driftless Brownian motion with instantaneous variance a{y), so that if (^Xu{t), Yu{t)) G M x / is 
the space-type location of individual u at time t then we have 

dX4t)^aiYuit))dBt 

for a Brownian motion Bt. Fission of a particle of type y occurs at a rate r{y) to produce two 
particles at the same space-type location as the parent. 

We define J := R x /, and suppose that the configuration of this whole branching diffusion 
at time t is given by the J-valued point process = {(^^(t), Yu{t)^ : u G Nt} where Nt is the 
set of individuals alive at time t. Let the measures {P'^ y : {x,y) e on the filtered space 
(O, Too, {^t)t>o) be such that under P^'^ a single initial ancestor starting at (x, y) evolves as the 
branching diffusion, Xt, as described above and where {J-t)t>o the natural filtration generated 
by the point process Xt. 

In this branching diffusion, each particle moves in a stochastically similar manner: let a 
process {£,t,ilt) on J under a measure P behaves stochastically like a single particle in the 
branching-diffusion Xf with no branching occurring. Thus, rjt is an irreducible, time-reversible 
Markov chain on / with Q-matrix dQ and invariant measure tt = {tti, . . . , 7r„}, whilst moves 
as a driftless Brownian motion and diffusion coefficient a{y) > whenever rjt is in state y: 

d6 = a{rjt)^ dBt, 

for a P-Brownian motion Bf. We note that the formal generator of this process {£,t,rit) is: 

nF{x,y) = ^aiy)^+eJ2Qiy,j)Fi^,j), (F:J~>]R). (3) 

We shall often refer to the typed diffusion (^4,774) G M x / as the single particle model, after the 
work carried out by Harris and Williams [21]. 

In Hardy [f G] and Hardy and Harris [19] this finite-type branching diffusion has been investi- 
gated, and we briefly mention that the proofs have been based on the following two martingales, 
the first based on the whole branching diffusion and the second based only on the single-particle 
model: 

Zxit) := vx{Yu{t))e^''-^'^-'''\ (4) 
ueNt 

Ut) :=e/o«(^'»)d««^(ry,)e^«'-^^*, (5) 

where v\ and E), satisfy: 

(^h''A + 0Q + Ryx=Exvx, 
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which is to say that v\ is an eigenvector of the matrix ^X^A + OQ + R, with eigenvalue E\. These 
two martingales should be compared with the corresponding martingales (1) and e^Bt-^>- t ^^j, 
BBM and a single Brownian motion respectively. 

2.2 A general branching Markov process 

The spine constructions in our formulation can be applied to a much more general branching 
Markov model, and we shall base the presentation on the following model, where particles move 
independently in a general space J as a stochastic copy of some given Markov process 5( , and 
at a location-dependent rate undergo fission to produce a location-dependent random number 
of offspring that each carry on this branching behaviour independently. 

Definition 2.1 (A general branching Markov process) We suppose that three initial ele- 
ments are given to us: 

• a Markov process St in a measurable space {J,B), 

• a measurable function i? : J — > [0, oo), 

• for each x G J we are given a random variable A{x) whose probability distribution on 
the natural numbers {0,1,...} is P[A{x) — — pk{x), and whose mean is m{x) := 

From these ingredients we can build a branching process in J according to the following recipe: 

• Each particle of the branching process will live, move and die in this space { J,B), and if an 
individual u is alive at time t we refer to its location in J as Xu{t). Therefore the time-t 
configuration of the branching process is a J-valued point process Xi {Xu{t) : u £ Nt) 
where Nt denotes the collection of all particles alive at time t. 

• For each individual u, the stochastic behaviour of its motion in J is an independent copy 
of the given process St . 

• The function i? : J ^ [0, oo) determines the rate at which each particle dies: given that u 
is alive at time t, its probability of dying in the interval -|- d£) is R{Xu{t))dt -t- o(di). 

• If a particle u dies at location x ^ J it is replaced by 1 -\- A^ particles all positioned at x, 
where A^ is an independent copy of the random variable A(x). All particles, once born, 
progress independently of each other. 

We suppose that the probabilities of this branching process are {P^ : .t G j} where under 
one initial ancestor starts out at x. 

We shall first give a formal construction of the underlying probability space, made up of the 
sample trees of the branching process Xt in which the spines are the distinguished lines of descent. 
Once built, this space will be filtered in a natural way by the underlying family relationships of 
each sample tree, the diffusing branching particles and the diffusing spine, and then in section 4 
we shall explain how we can define new probability measures that extend each up to the 
finest filtration that contains all information about the spine and the branching particles. Much 
of the notation that we use for the underlying space of trees, the filtrations and the measures is 
closely related to that found in Kyprianou [28]. 

Although we do not strive to present our spine approach in the greatest possible generality, 
our general model already covers many important situations whilst still being able to clearly 
demonstrate all the key spine ideas. In particular, in all our models, new offspring always 
inherit the position of their parent, although the same spine methods should also readily adapt 
to situations with random dispersal of offspring. 
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For greater clarity, we often use the finite- type branching diffusion of Section 2.1 to introduce 
the ideas before foUowing up with the general formulation. For example, in this finite-type model 
we would take the process St to be the single-particle process {S,t,Vt) which lives in the space 
J := R X / and has generator H given by (3). The birth rate in this model at location (x, y) € J 
will be independent of x and given by the function R{y) for all y G / and, since only binary 
branching occurs in this case, we also have P{A{x,y) = 1) = 1 for all {x,y) £ J. 

3 The underlying space for spines 
3.1 Marked Galton- Watson trees with spines 

The set of Ulam-Harris labels is to be equated with the set il of finite sequences of strictly- 
positive integers: 

n := {0}U |J(Nr, 

nGN 

where wc take N = {l,2,...}. For two words u,v G ft, uv denotes the concatenated word 
(u0 = 0u = u), and therefore Q. contains elements like '213' (or '0213'), which we read as 'the 
individual being the 3rd child of the 1st child of the 2nd child of the initial ancestor 0'. For two 
labels v,u £ ri the notation v < u means that v is an ancestor of m, and |u| denotes the length 
of u. The set of all ancestors of u is equally given by 

{w : w < u} = : 3w G such that vw = u}. 

Collections of labels, ie. subsets of fi, will therefore be groups of individuals. In particular, 
a subset r C f2 will be called a Galton- Watson tree if: 

1. e T, 

2. if u, w G ri, then uv £ t implies u € t, 

3. for all u E T, there exists Au G 0,1,2,... such that uj G r if and only if 1 < j < 1 -|- A„, 
(where j G N). 

That is just to say that a Galton- Watson tree: 

1. has a single initial ancestor 0, 

2. contains all ancestors of any of its individuals v, 

3. has the 1 + Au children of an individual u labelled in a consecutive way, 

and is therefore just what wc imagine by the picture of a family tree descending from a single 
ancestor. Note that the '1 < j < 1 + A^ condition in 3 means that each individual has at least 
one child, so that in our model wc arc insisting that Galton- Watson trees never die out. 

The set of all Galton- Watson trees will be called T. Typically we use the name r for a 
particular tree, and whenever possible we will use the letters u or u or w to refer to the labels 
in T, which we may also refer to as nodes of t or individuals in t or just as particles. 

Each individual should have a location in J at each moment of its lifetime. Since a Galton- Watson 
tree r G T in itself can express only the family structure of the individuals in our branching 
random walk, in order to give them these extra features wc suppose that each individual u G r 
has a mark {X^, cr„) associated with it which wc read as: 

• cr„ G is the lifetime of u, which determines the fission time of particle u as Su ■= 
'I2v<u '^'J (with 5*0 := (70). The times Su may also be referred to as the death times; 
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• Xu : [Su — ctutSu) — > J gives the location of u at time t G [Su — Cu, Su)- 

To avoid ambiguity, it is always necessary to decide whether a particle is in existence or not at 
its death time. 

Remark 3.1 Our convention throughout will be that a particle u dies 'just before' its death time 
Su (which explains why we have defined '■ [Su — cr„,S'„) ■ for example). Thus at the time 
Su the particle u has disappeared, replaced by its 1 + Au children which are all alive and ready 
to go. 

We denote a single marked tree by (r, X, a) or (r, M) for shorthand, and the set of all marked 
Galton- Watson trees by T: 

• T := < (r, X, cr) : T G T and for each u E t, G M+, X„ : [Su ~ c-u, Su) J 



For each (r, X, a) G T, the set of particles that arc alive at time t is defined as Nt := {u G 

T : Su — CTu < t < Su}. 

Where we want to highlight the fact that these values depend on the underlying marked tree we 
write e.g. Nt{{T,X,a)) or 5„((r,M)). 

Any particle u G t that comes into existence creates a subtree made up from the collection 
of particles (and all their marks) that have u as an ancestor - and u is the original ancestor of 
this subtree. 

• {t, X, a)J , or [t,M)^ for shorthand, is defined as the subtree growing from individual u's 
jth child uj, where 1 < j < 1 + Au. 

This subtree is a marked tree itself, but when considered as a part of the original tree we have 
to remember that it comes into existence at the space-time location {Xu{Su — (Ju)tSu — cTu) ~ 
which is just the space-time location of the death of particle u (and therefore the space-time 
location of the birth of its child uj). 

Before moving on there is a further useful extension of the notation: for any particle u we extend 
the definition of Xu from the time interval [Su — (Ju, Su) to allow all earlier times t G [0, Su). 

Definition 3.2 Each particle u is alive in the time interval [Su ~ cr„,5'„), but we extend the 
concept of its path in J to all earlier times t < Su: 



Xu{t) 



Xu{t) if Su ~ CTu <t < Su 

Xu (t) if V < u and Jy < t < Sy 



Thus particle u inherits the path of its unique line of ancestors, and this simple extension will 
allow us to later write expressions like exp{/p /(s) dXu{s)} whenever u & Nt, without worrying 
about the birth time of u. 

For any given marked tree (r, Af ) G T we can identify distinguished lines of descent from the 
initial ancestor: 0,ui, 1*2,^3, . . . G r, in which U3 is a child of U2, which itself is a child of ui 
which is a child of the original ancestor 0. We'll call such a subset of r a spine, and will refer to 
it as ^: 

• a spine ^ is a subset of nodes {0, ui, U2, U3, . . .} in the tree t that make up a unique line 
of descent. We use £,t to refer to the unique node in ^ that that is alive at time t. 

In a more formal definition, which can for example be found in the paper by Rouault and Liu 
[29], a spine is thought of as a point on dr the boundary of the tree - in fact the boundary 
is defined as the set of all infinite lines of descent. This explains the notation ^ G (3t in the 
following definition: we augment the space T of marked trees to become 
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is the set of marked trees with distinguished 

spines. 

It is natural to speak of the position of the spine at time t which think of just as the position of 
the unique node that is in the spine and ahve at time t: 

• we define the time-i position of the spine as := Xu{t)^ where it G ^ n A't. 

By using the notation to refer to both the node in the tree and that node's spatial position 
we are introducing potential ambiguity, but in practice the context will make clear which we 
intend. However, in case of needing to emphasize, we shall give the node a longer name: 

• nodet((r, M, ^)) := w if u G ^ is the node in the spine alive at time t, 

which may also be written as node((^). 

Finally, it will later be important to know how many fission times there have been in the 
spine, or what is the same, to know which generation of the family tree the node is in (where 
the original ancestor is considered to be the 0th generation) 

Definition 3.3 We define the counting function 

nt = \ nodet{(,)\, 

which tells us which generation the spine node is in, or equivalently how many fission times there 
have been on the spine. For example, if £,t = (0,1*1,^2) then both and ui have died and so 
nt = 2. 

3.2 Filtrations 

The reader who is already familiar with the Lyons et al [27, 30, 31] papers will recall that 
they used two separate underlying spaces of marked trees with and without the spines, then 
marginalized out the spine when wanting to deal only with the branching particles as a whole. 
Instead, we are going to use the single underlying space T, but define four filtrations of it that 
will encapsulate different knowledge. 

3.2.1 Filtration iTt)t>o 

We define a filtration of T made up of the cr-algebras: 

Tt := a-(^{u,Xu,a-u) : Su < t ; {u,Xu{s) : s G [5„ - crt,,i]) : t e [Su - cr„,S'„)^, 

which in words means that J-t is generated by all the information regarding the branching 
particles that have lived and died before time t (this is the condition Su < t), along with just 
the information up to time t of those particles still alive at time t (this is the t G [Su — ctu, Su) 
condition). Each of these tr-algebras will be a subset of the limit defined as 

^co :-f^(U-^*)- 

t>o 
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3.2.2 Filtration {J^t)t>o 

In order to know about the spine, we make this fihration finer, defining J-t by adding into J-t 
tlie knowledge of which node is tire spine at time t: 

J^t ;= (T(j^t,nodet(^)), ^oo := c^^lj ^t) ■ 

Consequently this filtration knows everything about the branching process and everything about 
the spine: it knows which nodes make up the spine, when they were born, when they died (ie. 
the fission times Su)^ and their family sizes. 

3.2.3 Filtration (C/t)f>o 

We define a filtration of T, {^t}j>Q, where the cr-algebras 

are generated by only the spatial motion of the spine in the J. Note that the events G ^ Qt do 
not know which nodes of the tree r actually make up the spine. 

3.2.4 Filtration (^t)t>o 

We augment Qt by adding in information on the nodes that make up the spine (as we did from 
Tt to J-t), as well as the knowledge of when the fission times occurred on the spine and how big 
the families were that were produced: 

gt := cj{gt, (node,(0 : s <t),{Au : u < ^t)), Goo ^t(|J Qt). 

3.2.5 Summary 

In brief, the key filtrations we shall make key use of are: 

• J^t knows everything that has happened to all the branching particles up to the time t, but 
does not know which one is the spine; 

• J-f knows everything that J-f knows and also knows which line of descent is the spine (it 
is in fact the finest filtration); 

• Qt knows only about the spine's motion in J up to time t, but does not actually know 
which line of descent in the family tree makes up the spine; 

• Qt knows about the spine's motion and also knows which nodes it is composed of. Further- 
more it knows about the fission times of these nodes and how many children were born at 
each time. 

We note the obvious relationships between these filtrations of T that J-t C Tt and Qt C Qt C Tt- 
Trivially, we also note that Qt ^ J-t, since the filtration J-t does not know which line of descent 
makes up the spine. 
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4 Probability measures 



Having now carefully defined the underlying space for our probabilities, we remind ourselves of 
the probability measures: 

Definition 4.1 For each x G J , let be the measure on (T, ^oo) such that the filtered prob- 
ability space {T,Too, {^t)t>Oi P ) is the canonical model for Ht, the branching Markov process 
described in Definition 2.1. 

For details of how the measures P^ are formally constructed on the underlying space of trees, 
we refer the reader to the work of Neveu [32] and Chauvin [G, 4]. Note, we could equally think 
of P^ as a measure on {T,!Foo), but it is convenient to use the enlarged sample space T for all 
our measure spaces, varying only the filtrations. 

Our spine approach relies first on building a measure P^ under which the spine is a single 
genealogical line of descent chosen from the underlying tree. If we are given a sample tree (r, M) 
for the branching process, it is easy to verify that, if at each fission we make a uniform choice 
amongst the offspring to decide which line of descent continues the spine ^, when m e r we have 

Prob(u e = n (6) 

(In the binary-branching case, for example, Proh{Ay = 1) = 1 and then Prob(it G ^) = 2^"'.) 
This simple observation is the key to our method for extending the measures, and for this we 
make use of the following representation found in Lyons [-M )] . 

Theorem 4.2 /// is a J- 1 -measurable function then we can write: 

/ = E (7) 

uGNt 

where /„ is J-f-measurable. 

As a simple example of this, in the case of the finite-typed branching diffusion of Section 2.1, 
such a representation would be: 

e/c' d^„;,(^,)e^«-^^* = e^o«(^"(«))d^«^(y„(i))e^^"W-^^*l(^^^„). (8) 



Definition 4.3 Given the measure P^ on (T, J-qo) we extend it to the probability measure 
on {T,J-ao) by defining 

/,/d^^-/,E/«^T^dP^ (9) 

for each f G mJ-t with representation like (7). 

The previous approach to spines, exemplified in Lyons [•)()], used the idea of fibres to get a 
measure analogous to our P that could measure the spine. However, a perceived weakness in 
this approach was that the corresponding measure had time-dependent total mass and could 
not be normalized to become a probability measure with an intuitive construction, unlike our 
P. Our new idea of using the down-weighting term of (6) in the definition of P is crucial in 
ensuring that we get a very natural probability measure (look ahead to Lemma 6.6), and leads 
to the very useful situation in which all measure changes in our formulation are carried out by 
martingales. 
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Theorem 4.4 This measure really is an extension of in that P = P\j^aa- 
Proof: If / e mJ^t then the representation (7) is trivial and therefore by definition 

However, it can be shown that X^mgatj IltKu i+a = 1 t)y retracing the sum back through the 
fines of ancestors to the original ancestor 0, factoring out the product terms as each generation 
is passed. Thus 

J^fdP^ l^fdP 

□ 

Definition 4.5 The filtered probability space {T , J-'ao, {J-t)t>Oi P) together with (Xt,^t) will be 
referred to as the canonical model with spines. 

In the single-particle model of section 2.1 we assumed the existence of a separate measure P 
and a process {^t,Vt) that behaved stochastically like a 'typical' particle in the typed branching 
diffusion Xj. In our formalization the spine is exactly the single-particle model: 

Definition 4.6 We define the measure P on (T,Qoo) as the projection of P: 

■=P\g.- 

Under the measure P the spine process £,t has exactly the same law as 5(. 

Definition 4.7 The filtered probability space (T, Qao, (St)t>0: P) together with the spine process 
will be referred to as the single-particle model. 

5 Martingales 

Starting with the single Markov process Sj that lives in (J, B) we have built (Xt, ^t), a branching 
Markov process with spines, in which the spine behaves stochastically like the given 5t. In this 
section we are going to show how any given martingale for the spine (^{t) leads to a corresponding 
additive martingale for the whole branching model. 

We have actually seen an example of this already. For the finite- type model of section 2.1 
we met two martingales: 

Z,{t):= ^ ^;,(y„(^))e^^"(*)-^^^ (10) 
ueNt 

Ut) :=e/o^("=)^^«A(?7t)e^^*~''^*- (H) 

Just from their very form it has always been clear that they are closely related. What we shall 
later be demonstrating in full generality in this section is that the key to their relationship comes 
through generalising the following JFt-measurable martingale for the multi-type BBM model: 

Definition 5.1 We define an J-'t -measurable martingale: 

Ut) n (1 + ^ "A('?t)e'«'-^^*. (12) 

u<it 
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An important result that we show in this article, in a more general form, is that Zx{t) and C\{t) 
are just conditional expectations of this new martingale (^x: 

We emphasize that this relationship does not appear to have previously been formalized, and 
that it is only possible because of our new approach to the definition of P as a probability 
measure, and of our using filtrations to capture the different knowledge generated by the spine 
and the branching particles. 

Furthermore, in the general form that we present below it provides a consistent methodology 
for using well-known martingales for a single process to get new additive martingales for 
the related branching process. In Hardy and Harris [fcS, 17] we use these powerful ideas to 
give substantially easier proofs of large-deviations problems in branching diffusions than have 
previously been possible. 

Suppose that ({t) is a (T, (C/t)t>o, P) -martingale, which is to say that it is a ^?t-measurable 
function that is a martingale with respect to the measure P. For example, in the case of our 
finite- type branching diffusion this could be the martingale C\{t) which is 5t-measurable since 
it refers only to the spine process (Ct,'7t). 

Definition 5.2 We shall call ({t) a single-particle martingale, since it is Qt-measurable and 
thus depends only to the spine ^. 

Any such single-particle martingale can be used to define an additive martingale for the whole 
branching process via the representation (7): 

Definition 5.3 Suppose that we can represent the martingale Q{t) as 

m = E CnWi(6=«)> (13) 

ueNt 

for Cuit) S rnJ-'t, as at (7). We can then define an J-f-measurable process Z{t) as 

Z{t) := e-/o"(^"(-))«(^"(-))d^C«W, 

u^Nt 

and refer to Z{t) as the branching-particle martingale. 

The martingale property Z(t) will be established in Lemma 5.7 after first building another 
martingale, Cl^)? from the single-particle martingale C(^)- First, for clarity, we take a moment 
to discuss this definition of the additive martingale and the terms like Cuit). 
If we return to our familiar martingales (10) and (11), it is clear that 

Cxit) = efo^^^'^)^^vx{v,)e'^'-^^' = e^o «(^"(^))'i^.,(y40)e^^^"(*)-^^* (14) 

u^Nt 

The terms of (13) could be here replaced with a more descriptive notation Cx\{^u-,yu)\(fy-, 
where 

C„(t) =CA[(^«,rj](t) :=e/o d«^;,(y„(t))e^^"(*)-i^.t, 

can be seen to essentially be a functional of the space- type path (X„(t), ^^(t)) of particle u. In 
this way the original single-particle martingale Cx would be understood as a functional of the 
space- type path (^t, 77^) of the spine itself and we could write 

CA(i) = Ca[(C,??)](0 - E CA[(^«,i^)](0 
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This is the idea behind the representation (13), and in those typical cases where the single- 
particle martingale is essentially a functional of the paths of the spine ^t, as is the case for our 
Cxit), we should just think of as being that same functional but evaluated over the path 
Xu{t) of particle u rather than the spine ^t. The representation (13) can also be used as a more 
general way of treating other martingales that perhaps are not such a simple functional of the 
spine path. 

Finally, from (14) it is clear that the additive martingale being defined by definition 5.3 is 
our familiar Zx(t): 

Z,it)^ ^ e-/o«(>'.(«))d.CA[(X„,y„)](t)= «A(K(t))e^^"(*^-^^*. 

ueNt ueNt 

Although definition 5.3 will work in general, in the main the spine approach is interested 
in martingales that can act as Radon-Nikodym derivatives between probability measures, and 
therefore we suppose from now on that (^{t) is strictly positive, and therefore that the additive 
martingale Z{t) is strictly positive. 



The work of Lyons et al [30, 27, 31], that of Chauvin and Rouault [7] and more recently of 
Kyprianou [28] suggests that when a change of measure is carried out with a branching-diffusion 
additive martingale like Z{t) it is typical to expect three changes: the spine will gain a drift, 
its fission times will be increased and the distribution of its family sizes will be size-biased. 
In section 6.1 we shall confirm this, but wc first take a separate look at the martingales that 
could perform these changes, and which we shall combine to obtain a martingale C(i) that will 
ultimately be used to change the measure P. 

Theorem 5.4 The expression 

[](l + m(e5j)e-^o™(?=W?=)d^ 

v<it 

is a P -martingale that will increase the rate at which fission times occur along the spine from 
R{it) to (l + m(6))i?(et). 



dL 



((l+m(0)fl(0) 



l^t v<(,t 

where L^"^^^^^ is the law of the Poisson (Cox) process with rate R{S,t) at time t. 
Theorem 5.5 The term 

is a P -martingale that will change the measure by size-biasing the family sizes born from the 
spine: 

(l + fc)pfe(CsJ 



if V < ^ti then Prob{Ay ~ k) 



l + ^"(esj 



The product of these two martingales with the single-particle martingale C,{t) will simultaneously 
perform the three changes mentioned above: 
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Definition 5.6 We define a Tt-measurahle martingale as 

= n li^x n(i+-(^^j)^-'"^«'"^^^^"xc(o. (15) 

Significantly, only the motion of tlie spine and the behaviour along the immediate path of the 
spine will be affected by any change of measure using this martingale. Also note, this martingale 
is the general form of Ca(0 that we defined at (12) for our finite-type model. 

The real importance of the size-biasing and fission-time-increase operations is that they 
introduce the correct terms into ({t) so that the following key relationships hold: 

Lemma 5.7 Both Z{t) and C(t) are projections of Q{t) onto their filtrations: for all t, 

. z{t)^p{at)\^t), 

Proof: We use the representation (7) of 

lit) = y: n(i+^")^^^°'"^''"'^"''^''"^^^^'^c.wi(6=«)- (16) 

From this it follows that 

P{m^t) - E e-/o™(^"M)«(^"W)ds^„(i) X J](l + A„)P(l(^,_)|.f,) 

= J2 e-/o™(^"(-))«(^"(^))d-C„(t) = Z(t), 

since |J^t) = l^ueN^) UvkuC^ + ^^v)^^ ■ 

On the other hand, the martingale terms in (15) imply 

p(c>)iet) =c(o xp(n(i+^«)e"^"'"^^=^''^^=^'ieO ^c(i), 

v<it 

□ 



6 Changing the measures 



For the finite type model, the single-particle martingale C\{t) defined at (5) can be used to define 
a new measure for the single-particle model (as in [IG]), via 



dPA 
dP 



Cxit) 



C(o) ■ 

We have now seen the close relationships between the three martingales Ca, Zx and Ca: 
Zx{t) = P{Ut) I ^t), Cx{t) = P{Ut) I Gt), 
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and in this section we show in a more general form how these close relationships mean that a 
new measure Q\ defined in terms of P as 



dP 



will induce measure changes on the sub-filtrations Qt and J-t of j^t whose Radon-Nikodym 
derivatives are given by Ca(0 ^^'^ respectively. We will also give an extremely useful 

and intuitive construction of the measures P and Q. 

We recall that in our set up we have a finest filtration {J't)t>Q associated with the measure P, 
and two sub-filtrations {J^t)t>o with measure P and {Qt)t>o with measure P. The martingale C, 
can change the measure P: 

Definition 6.1 A measure Q on {T,J-ao) is defined via its Radon-Nikodym derivative with 
respect to P: 

m 



dP 



m' 



Recall, this notation means that for each event F £ Tt we define Q{F) :~ P(lpZ{t)y 
As we did for the measures P and P in Section 4, we can restrict Q to the sub-filtrations: 
Definition 6.2 We define the 

Definition 6.3 We define the measure P on {T,Qao, iGt)t>o) via 



A consequence of our new formulation in terms of filtrations and the equalities of Lemma 
5.7 is that the changes of measure are carried out by Z{t) and C,{t) on their subfiltrations: 



Theorem 6.4 



dP 



m 

z{oy 



and 



dl 

di 



Gt 



m 
m' 



Proof: These two results actually follow from a more general observation that if fli and /i2 are 
two measures defined on a measure space (fi, S) with Radon-Nikodym derivative 



— = / 
d/ii 



(17) 



and if 5 is a sub-cr-algebra of S, then the two measures fii := fLi\s and /i2 := M2I5 on {^l,S) are 
related by the conditional expectation operation: 
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The proof of this is that if 5 G mS and S E S then 

g d/i2 ~ g d/i2 since g is also in mS, and S E S too, 

s Js 



gfdili by (17), 

s 

= / /2i(5/|5) d/ii by definition of the conditional expectation, 

Js 

= / d/ii since g is iS-measurable, 

= / 9 f^i{f\^) d^i since everything is in mS. 

Js 

Applying this general result (17) using the relationships between the general martingales given 
in Lemma 5.7 concludes the proof. □ 

6.1 Understanding the measure Q 

As the name suggests, we should be able to think of the spine as the backbone of the branching 
process. This is made precise by the following decomposition: 

Theorem 6.5 The measure P on Tt can be decomposed as: 

dP(r, = dP(OdL(«(«))(n) J] —J- n p^„(a) ndP((T,M)j), (18) 

where L'-"'^^^^^ is the law of the Poisson ( Cox) process with rate R{^t) o-t time t, and we remember 
that nt counts the number of fission times on the spine before time t. 

We can offer a clear intuitive picture of this decomposition, which we summarize in the following 
lemma. 

Lemma 6.6 The decomposition of measure P at (18) enables the following construction: 

1. the spine's motion is determined by the single-particle measure P; 

2. the spine undergoes fission at time t at rate R{S,t); 

3. at the fission time of node v on the spine, the single spine particle is replaced by 1 + Ay 
children, with Ay being chosen independently and distributed according to the location- 
dependent random variable A{^Sv) with probabilities (pki^Sv) ■ k = 0,1, . . .); 

4- the spine is chosen uniformly from the 1 + Ay children at the fission point v; 

5. each of the remaining Ay children gives rise to the independent subtrees (r, M)J, for 1 < 
j < Ay, which are not part of the spine and which are each determined by an independent 
copy of the original measure P shifted to their point and time of creation. 

This decomposition of Pt given at (18) will allow us to interpret the measure Q if we appropriately 
factor the components of the change-of-measure martingale ((t) across this representation. On 
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dQ = C(OdP 

= dP(OdL«^+™(«))^(«))(n) n n T^:^f^^i^s.)^^P{{r,M)^). (19) 

Just as we did for P, we can offer a clear interpretation of this decomposition: 
Lemma 6.7 Under the measure 

1. the spine process moves as if under the changed measure P; 

2. the fission times along the spine occur at an accelerated rate (1 + rn(S,f))R{^t)j 

3. at the fission time of node v on the spine, the single spine particle is replaced by 1 + Ay 
children, with Ay being chosen as an independent copy of the random variable A(y) which 
has the size biased offspring distribution ((1 + k)pk{y) / {I + Tn{y)) : fc = 0, 1, . . .), where 
y = G J is the spine 's location at the time of fission; 

4-. the spine is chosen uniformly from the 1 + Ay particles at the fission point v; 

5. each of the remaining Ay children gives rise to the independent subtrees {t,M)^, for 1 < 
j < Ay, which are not part of the spine and evolve as independent processes determined by 
the measure P shifted to their point and time of creation. 

Such an interpretation of the measure Q was first given by Chauvin and Rouault [7] in the context 
of BBM, allowing them to come to the important conclusion that under the new measure Q the 
branching diffusion remains largely unaffected, except that the Brownian particles of a single 
(random) line of descent in the family tree are given a changed motion, with an accelerated 
birth rate - although they did not have random family sizes, so the size-biasing aspect was 
not seen. In the context of spines, size-biasing was first introduced in the Lyons et al papers 
[30, 27, 31]. Kyprianou [28] presented the decomposition of equation (19) and the construction 
of Q at Lemma 6.7 for BBM with random family sizes, but did not follow our natural approach 
starting with the probability measure P that has subtly facilitated various benefits. 



7 The spine decomposition 

One of the most important results introduced in Lyons [M )] was the so-called spine decomposition, 
which in the case of the additive martingale 

Zx{t)^ E ^'A(r.(t))e^^""W-^^*, 

ueNt 

from the finite-type branching diffusion would be: 

QA(^A(0|eoo) - MvsJe^^'--''-^-+vxirjt)e^^^-''-'. (20) 

u<Nt 

To prove this we start by decomposing the martingale as 

Zxit) = E ^'A(K,(t))e^^"(*)-^^* + v,{vt)e'^^-''^\ 
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which is clearly true since one of the particles u £ Nt must be the in the line of descent that 
makes up the spine ^. Recalling that the cr-algebra Q^o contains all information about the line 
of nodes that makes up the spine, all about the spine diffusion (Ct,??*) for all times t, and also 
contains all information regarding the fission times and number of offspring along the spine, it 
is useful to partition the particles wG{ueiVt,u^^} into the distinct subtrees (r, M)" that 
were born at the fission times Su from the particles that made up the spine before time t, or in 
other words those nodes in the {u < ^t} of ancestors of the current spine node ^t. Thus: 

If we now take the QA-conditional expectation of this, we find 

u<(t uGWf ,tie(r,J\f)" 

We know from the decomposition (19) that the under the measure Q\ the subtrees coming off 
the spine evolve as if under the measure P, and therefore 

Qa( E «A(>;w)e^(^"(*)-«-"'-^^(*-^")|c7oo) 

veNt.ve{T,M)^ 

= P( E t'A(nW)e^(^"W-«-")-^^(*-^")| Soo) = vxivsj, 

v£Nt,v(£(T,M)^ 

since the additive expression being evaluated on the subtree is just a shifted form of the mar- 
tingale Zx itself. 

This concludes the proof of (20), but before we go move on to give a similar proof for 
the general case, for easier reference through the cumbersome-looking general proof it is worth 
recalling that 

and therefore noting that (20) can alternatively be written as 

u<Nt 

Also, in the general model we are supposing that each particle u in the spine will give birth to 
a total of Au subtrees that go off from the spine - the one remaining other offspring is used to 
continue the line of descent that makes up the spine. This explains the appearance of A.^ in the 
general decomposition. 

Theorem 7.1 (Spine decomposition) We have the following spine decomposition for the 

additive branching-particle martingale: 

QHZ{t)\g^) - Y ^«e-/o"""(«^)«(«=)<^^C(^«) + e-^o '^^CW- 

Proof: In each sample tree one and only one of the particles alive at time t is the spine and 
therefore: 

Z{t)^ Y e-/o'"(^"(-))«(^"(^))d-C,(t), 
ueNt 

= e-/o ds^(i) + g-/„'m(X„(s))i?,(X„(s))ds^^(-^^^ 
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The other individuals {u S Nt,u ^ ^t} can be partitioned into subtrees created from fissions 
along the spine. That is, each node u in the spine (so u < ^t) has given birth at time Su to 
one offspring node uj (for some 1 < j < 1 + A„) that was chosen to continue the spine whilst 
the other Au individuals go off to make the subtrees (r, M)". Therefore, 

where for t>Su, 

Z^,iS^;t) := Y e-/s„"(^.(^))«(^.^(^))d^C.(t), 

is, conditional on Qoo, a P-martingale on the subtree {t,M)^, and therefore 

P{Zu,{Su;t)\goo) ^CiSu). 
Thus taking Q-conditional expectations of (21) gives 

Q^(^(t)|eoo) =e-/o ds^(^)^ ^ e-/o^"™(6WC.)d.p(^ Y Z„,(5„;i)|eoo), 

M<Ct J = l,...,l + Au 

= e-/o™(«^)«(«^)d«C(t)+ E e-^o""™(«=W«=)dM„C(5„), 

u<£,t 

which completes the proof. □ 

This representation was first used in the Lyons et al [.30, 27, 31] papers and has become the 
standard way to investigate the behaviour of Z under the measure Q. We observe that the two 
measures P and Q for the general model are equal when conditioned on Qoo since this factors 
out their differences in the spine diffusion , the family sizes born from the spine and the fission 
times on the spine. Therefore it follows that same argument as used above applies for P to give: 

Corollary 7.2 

P{Z{t)\goo) = J2 ^ne-^o'" d^C(5„)+e-/o d.^C(i)- 

8 Spine results 

Having covered the formal basis for our spine approach, we now present some new results that 
follow from our spine formulation: the Gibbs-Boltzmann weights, conditional expectations, and 
a simpler proof of the improved Many-to-One theorem. 

8.1 The Gibbs-Boltzmann weights of Q 

The Gibbs-Boltzmann weightings in branching processes are well-known, for example see Chau- 
vin and Rouault [.">] where they consider random measures on the boundary of the tree, and 
Harris [22] which gives convergence results for Gibbs-Boltzmann random measures. They have 
previously been considered via the individual terms of the additive martingale Z , but the fol- 
lowing theorem gives a new interpretation of these weightings in terms of the spine. We recall 
that 

Z{t)= Y e-/o™(^"(-))«(^"(-))d-C«(0- 
ueNt 
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Theorem 8.1 Let u G Q be a given and fixed label. Then 

Proof: Suppose u € ft, and F G Tt- We aim to show: 

y^l(e,=„) dQ(T,Af,0 = z{t) ^clQ(r,M,0. 

First of all we know that dQ/dP — (^{t) on JT^ and therefore, 

by definition of <^{t) at (15). The definition 4.3 of the measure P requires us to express the 
integrand with a representation like (7): 

and therefore 

LHS= / liuem)]J{l + A,)e- fo"^^^^(^^^^(^^<^^))<'Xuit)li^,=u) dP(r,M,C), 



by definition 4.3. We emphasize that now this is an integral taken with respect to the measure 
P over the cr-algebra Tt, and here we know that dP/dQ = 1/Z{t), so: 

LHS = y^l(„g^,)e-/o"(^"(^))«(^"(^))d.^C„(t) _L_dQ(r,Af,0, 

and the proof is concluded. □ 

The above result combines with the representation (7) to show how we take conditional 
expectations under the measure Q. 

Theorem 8.2 If f{t) e mTt, and f ^ J2ueNt /"(*)1(6=") ' ^ ^^^^ 

Qif(tm = E (22) 

Proof: It is clear that 

u<£Nt 

and the result follows from Theorem 8.1. □ 

A corollary to this exceptionally useful result also appears to go a long way towards obtaining 
the Kesten-Stigum result in more general models: 
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Corollary 8.3 If g{-) is a Borel function on J then 

J2 g(X„(i))e-/o ™(^^.(^))«(^"(^))d^CuW = Q(5(6)l^0 X Zit). (23) 

ueNt 

Proof: We can write g{^t) = X^ueWt ff(^i'(^))-'-(Ct = ^i^d now the result follows from the 
above corollary. □ 

The classical Kesten-Stigum theorems of [25, 24, 26] for multi-dimensional Galton- Watson pro- 
cesses give conditions under which an operation like the left-hand side of (23) converges as 
t — > c», and it is found that when it exists the limit is a multiple of the martingale limit Z(oo). 
Also see Lyons et al [27] for a recent proof of this based on other spine techniques. Our spine 
formulation apparently gives a previously unknown but simple meaning to this operation in 
terms of a conditional expectation and, as we hope to pursue in further work, in many cases we 
would intuitively expect that Q{g{S,t)\y^t) /Q{g{£,t)) 1 a-S., leading to alternative spine proofs 
of both Kesten-Stigum like theorems and Watanabe's theorem in the case of BBM. 



8.2 The Full Many-to-One Theorem 

An very useful tool in the study of branching processes is the Many-to-One result that enables 
expectations of sums over particles in the branching process to be calculated in terms of an 
expectation of a single particle. In the context of the finite-type branching diffusion of section 
2.1, the Many-to-One theorem would be stated as follows: 

Theorem 8.4 For any measurable function f : J ^ R we have 

p.,y(^^ j(X„(t),y„(t))) =p-^^(e/o«(')»)dsj(^^^^^)y 

ueNt 

Intuitively it is clear that the up- weighting term e^o ^(vs)<is incorporates the notion of the pop- 
ulation growing at an exponential rate, whilst the idea of /(^t, ?/t) being the 'typical' behaviour 
of f{Xu{t),Yu{t)) is also reasonable. 

Existing results tend to apply only to functions of the above form that depend only on the 
time-t location of the spine and existing proofs do not lend themselves to covering functions that 
depend on the entire path history of the spine up to time t. 

With the spine approach we have the benefit of being able to give a much less complicated 
proof of the stronger version that covers the most general path-dependent functions. 

Theorem 8.5 (Many-to-One) // f{t) G mJ^t has the representation 

fit) = E /«wi(6=«)' 

ueNt 

where fu{t) G rnTt, then 

P{Y. /u(t)e-^""'^"(^»^(^"(^»'^^C«(i)) -P(/(t)C(t)) =C(0)Q(/(0)- (24) 

In particular, if g{t) G rjiQt with g{t) = "Yliui^Nt 9u(t)'^{it=u) where gu{t) G mJ-f, then 
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Proof: Let f{t) be an ^t-measurable function with the given representation. We can use the 
tower property together with Theorem 8.2 to obtain 



We emphasize that this is a Q expectation of a ^(-measurable expression. From Theorem 6.4, 

Zit) 



dP 

and therefore we have 



Z(0)' 



uGNt 

On the other hand, since f{t) is J^j-measurable and 

m 



dP 

we have 



C(o)- 



t))=p(/(t)xCWC(o)-^). 

Trivially noting Z{Q) ~ <^{0) = C(0) as there is only one initial ancestor, we can combine these 
expressions to obtain (24). For the second part, given g{t) G mQt, we can define 

fit) := e^o d«g(t) X C{ty\ 

which is clearly -measurable and satisfies f(t) — X^uG-Wt /"('•)-'-(5t=") with 

When we use this f{t) in equation (24) and recall Lemma 5.7, that P := P\g^ from Definition 
4.6 and that P := from Definition 6.3, we arrive at the particular case given at (25) in the 

theorem. □ 

In the further special case in which g = g{£,t) for some Borel-measurable function g{-), the 
trivial representation 

5(6) = E 9{Xu{t)) 

ueNt 

leads immediately to the weaker version of the Many-to-One result that was utilised and proven, 
for example, in Harris and Williams [21] and Champneys et al [A] using resolvents and the 
Feynman-Kac formula, expressed in terms of our more general branching Markov process Xt : 

Corollary 8.6 If g{-) : J ^ M. is B -measurable then 

p(E3(^u(t)))=P(e/o«(«=)d«g(6 
u^Nt 
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